Of the Dissertation Automatic Assessment of Non - Topical Properties of Text by Machine Learning Methods

نویسندگان

  • Ying Sun
  • YING SUN
  • Paul B. Kantor
چکیده

..............................................................................................ii Acknowledgements .................................................................................iv List of Tables ......................................................................................viii List of Figures ........................................................................................x Chapter 1: Introduction...................................................................................................1 Chapter 2: Non-Topical Qualitative Properties of Documents....................................7 2.1 User – Centered Studies....................................................................................... 8 2.2 Information – Centered Studies ......................................................................... 10 2.3 Summary of Properties ...................................................................................... 11 2.4 Assessment of Qualitative Properties ................................................................ 14 Chapter 3: Linguistic Features......................................................................................17 3.1 Stylistic Studies.................................................................................................. 17 3.1.1 Authorship Attribution Research ................................................................. 18 3.1.2 Genre Classification..................................................................................... 19 3.1.3 “Style” and Non-topical Qualitative Properties........................................... 20 3.2 Linguistic Features as Indicators ....................................................................... 21 Chapter 4: Automatic Classification Techniques........................................................24 4.1 Classification through Learning......................................................................... 25 4.2 Linear Regression .............................................................................................. 27 4.3 Logistic Regression............................................................................................ 28 4.4 Decision Tree Learning...................................................................................... 30 4.5 Support Vector Machines (SVMs)..................................................................... 34 4.6 Applications ....................................................................................................... 35 Chapter 5: Research Problems......................................................................................38 Chapter 6: Methodology, Experimental Design and Evaluation Measures..............41 6.1 Document Corpora............................................................................................. 41 6.2 Non-Topical Qualitative Property Judgments ................................................... 43

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

Emotion Detection in Persian Text; A Machine Learning Model

This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Stock Price Prediction using Machine Learning and Swarm Intelligence

Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005